Estimating evolution of freshness in Internet cache directories under the capture-recapture methodology
نویسندگان
چکیده
1 Abstract— In this paper, we describe a new web sampling schema for measuring the evolution of freshness in search engines. The methodology used is the capture-recapture, which is mainly applied for estimating evolution rates in wildlife biological studies. After modifications and amendments necessary for web paradigm application, we conducted three capture-recapture experiments of different duration over the caches of Google and MSN. In parallel, we used a typical sampling scheme, similar to many other web sampling approaches used in the literature, in order to evaluate the robustness of our proposal. The paper provides the implementation details of a web-based capture-recapture model along with its assessment. The results show that through the capture-recapture methodology we are able not only to measure the freshness of the tested search services but also to monitor its evolution over time, with a substantially lower amount of required sampling instances. It was not our intention to compare the performance of Google and MSN. However, through our experiments, we observed that although one sometimes presents better refresh rates than the other, in general both search services have virtually equal capabilities in refreshing their directories and providing new and up-to-date results to their users.
منابع مشابه
Semi-automatic e-chartering through multi-agent systems and satellite IP networks
Scholarly Contributions [Data Provided by ] Editorial on "qoS and service provisioning for integrated wireless networks" Towards a collaborative ranking mechanism for efficient and personalized internet search service provisioning On the feasibility of applying capturerecapture experiments for web evolution estimations Design and implementation of a VoiceXML-driven wiki application for assistiv...
متن کاملEstimating the size and evolution of categorised topics in web directories
In this paper a statistical approach for estimating the evolution of categorized web page populations in web directories is proposed. The proposal is based on the capture-recapture method used in wildlife biological studies and it is modified according to the necessary assumptions and amendments for conducting the experiments on the web. During these experiments, web pages are likened to animal...
متن کاملEstimation of Maternal Mortality Rate in Iran from 2010 to 2014 Using Capture-Recapture Method
Estimation of Maternal Mortality Rate in Iran from 2010 to 2014 Using Capture-Recapture Method Ayat Ahmadi 1, Bahareh Yazdizadeh 2, Alireza Zemestani 3* 1Assistant professor of Epidemiology, Knowledge Utilization Research Center, Tehran University of Medical Sciences, Tehran, Iran 2Associate professor of Epidemiology, Knowledge Utilization Research Center, Tehran University of Medical Science...
متن کاملEstimation of Road Traffic Mortality in Kurdistan Province, Iran, During 2004-2009, Using Capture-Recapture Method
Background: To reduce traffic injuries in the country, health professionals should have accurate estimates of road traffic deaths. Multiple and sometimes inconsistent statistics presented by organizations in charge create high degree of uncertainty for planners and decision makers. To achieve an accurate estimate, several methods are available. Of them, capture-recapture method ...
متن کاملA comparison of linear transect and capture recapture methods results in Iranian Jerboa population density and abundance estimation in Mirabad plains, Shahreza
During a period from spring 2008 till fall 2010, Iranian Jerboa population abundance was estimated using distance (linear transect) and capture-recapture methods in the Mirabad plains near Shahreza city in Isfahan Province. In the study period, during the active time of the species except reproduction time, we tried to live-trap, mark, release and recapture individuals based on Schnabel method ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Networks
دوره 54 شماره
صفحات -
تاریخ انتشار 2010